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Abstract 

Tbe theory of belief functions manages uncertainty and also proposes a set of com¬ 
bination rules to aggregate opinions of several sources. Some combination rules 
mix evidential information where sources are independent; other rules are suited to 
combine evidential information held by dependent sources. In this paper we have 
two main contributions: First we suggest a method to quantify sources’ degree of 
independence that may guide the choice of the more appropriate set of combina¬ 
tion rules. Second, we propose a new combination rule that takes consideration of 
sources’ degree of independence. The proposed method is illustrated on generated 
mass functions. 

Keywords: Theory of belief functions. Combination rules. Clustering, 
Independence, Sources independence. Combination rule choice 


1. Introduction 


Uncertainty theories like the theory of probabilities, the theory fuzzy sets 


I ID, the theory of possibilities yD and the theory of belief functions model 


and manage uncertain data. The theory of belief functions can deal with imprecise 
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and/or uncertain data provided by several belief holders and also combine them. 
Combining several evidential information held by distinct belief holders ag¬ 


gregates their points of view by stressing common points. In the ther^ of be 


functions, many combination rules are proposed, some of them like 


7 


ief 

21 


are fitted to the aggregation of evidential information provided by cognitively inde¬ 




pendent sources whereas the cautious, bold HOD and mean combination rules can 
be applied when sources are cognitively dependent. The choice of combination 
rules depends on sources independence. 

Some researches are focused on doxastic independence of variables such as 

~~ ar 


m IL11211 : others 


13D tackled cognitive and evidential independence of vari¬ 


ables. This paper is focused on measuring the independence of sources and not 
that of variables. We suggest a statistical approach to estimate the independence 
of sources on the bases of all evidential information that they provide. The aim of 
estimating the independence of sources is to guide the choice of the combination 
rule to be used when combining their evidential information. 

We propose also a new combination rule to aggregate evidential information 
and take into account the independence degree of their sources. The proposed 
combination rule is weighted with that degree of independence leading to the con¬ 
junctive rule 114] when sources are fully independent and to the cautious rule fl] 


when they are fully dependent. 

In the sequel, we introduce in Section 2 preliminaries of the theory of belief 
functions. In the Section 3, an evidential clustering algorithm is detailed. This clus¬ 
tering algorithm will be used in the first step of the independence measure process. 
Independence measure is then detailed in Section 4. It is estimated in four steps: In 
the first step the clustering algorithm is applied. Second a mapping between clus¬ 
ters is performed; then independence of clusters and sources are deduced in the last 
two steps. Independence is learned for only two sources and then generalized for 
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a greater number of sources. A new combination rule is proposed in the Section 5 
taking into account the independence degree of sources. The proposed method is 
tested on random mass functions in Section 6. Finally, conclusions are drawn. 


2. Theory of belief functions 


The theory of belief functions was introduced by Dempster yl] and formal¬ 
ized by Shafer to model imperfect data. The. frame of discernment also called 
universe of discourse, Tl = {(01,(02,... ,( 0 n }, is an exhaustive set of N mutually 
exclusive hypotheses cOj. The power set 2^ is a set of all subsets of fl; it is made of 
hypotheses and unions of hypotheses from Q.. The basic belief assignment fBBAj 
commonly called mass function is a function defined on the power set 2^ and spans 
the interval [0,1] such that: 

^ m{A) = 1 (1) 

ACQ. 

A basic belief mass fBBMj also called mass, m{A), is a degree of faith on the truth 
of A. The BBM, m(A), is a degree of belief on A which can be committed to its 
subsets if further information justifies if [Tl]. 

Subsefs A having a sfricfly posifive mass are called/oca/ elements. Union of all 
focal elemenfs is called core. Shafer assumed a normalify condifion such fhaf 
m(0) = 0, Ihereafler Smets Qi relaxed fhis condifion in order fo folerafe mfb) > 0. 

The frame of discernmenf can also be a focal elemenf; ifs BBM, m{Q.), is infer- 
prefed as a degree of ignorance. In fhe case of total ignorance, m{Q.) = 1. 

A simple support function is a mass funclion wifh fwo focal elemenfs including 
fhe frame of discernmenf. A simple supporf funclion m is defined as follows: 


m(A) = < 


1 — w // A = B for some B CQ. 
w if A = 

0 ofherwise 


( 2 ) 
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Where A is a focus of that simple support function and w € [0,1] is its weight. A 
simple support function is simply noted A'^. A nondogmatic mass function can 
be obtained by the combination of several simple support functions. Therefore, 
any nondogmatic mass function can be decomposed into several support functions 


using the canonical decomposition proposed by Smets jlSll . 


The belief function {bel) is computed from a BBA m. The amount bel{A) is the 
minimal belief on A justified by available information on B {B CL A): 


bel{A)= £ m{B) (3) 

The plausibility function {pi) is also derived from a BBA m. The amount pl{A) 
is the maximal belief on A justified by informafion on B which are nof confradicfory 
wifh A (AnB / 0): 

pK^) = L 

Ans/e 

Pignisfic fransformafion compufes pignisfic probabilifies from mass funcfions 
in fhe purpose of making a decision. The pignisfic probabilify of a single hypofhe- 
sis A is given by: 


BetP{A) 


Bcn,B^0 


|BnA| 

1 ^ 


m{B) 

1 — m(0) 


( 5 ) 


Decision is made according fo fhe maximum pignisfic probabilify. The single poinf 
having fhe greafesf BetP is fhe mosf likely hypofhesis. 


2.1. Discounting 

Sources of information are nof always reliable, fhey can be unreliable or even a 
little bif reliable. Taking info accounf reliabilify of sources, we adjusf fheir beliefs 
proportionally fo degrees of reliabilify. Discounfing mass funcfions is a way of 
faking consideration of sources’ reliabilifies info fheir mass funcfions. If reliabilify 
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rate a of a source is known or can be quantified; discounting its mass function m 
is defined as follows: 


m^{A) =axm{A) ,VAcfl 

= 1 — a X (1 — 


( 6 ) 


This discounting operator can be used nol only fo lake consideralion of source’s 
reliability, buf also fo consider any informalion which can be infegrafed into fhe 
mass function, (1 — a) is called discounting rate. 


2.2. Combination rules 


In fhe fheory of belief funcfions, a greal number of combination rules are used 
to summarize a sel of mass functions info only one. Lei and ^2 be Iwo disfincl 
and cognitively independenl sources providing Iwo differenl mass functions mi and 
m 2 defined on fhe same frame of discernmenl fl. Combining Ihese mass functions 
induces a Ihird one mi 2 defined on fhe same frame of discernment 


Iblll 


a. 


but we enu- 


There is a great number of combination rules yl 
merate in this section only Dempster, conjunctive, disjunctive. Yager, Dubois and 
Prade, mean, cautious and bold combination rules. The first combination rule was 
proposed by Dempster in yl] to combine two distinct mass functions mi and m 2 as 
follows: 

^ mi{B)xm2{C) 

BnC=A _ C Q..^ A^ib 

mi02(A) = {mi 0 ^ 2 )(A) = ^ mi{B)xm2{C) 

Bnc=0 


0 


ifA = % 


The BBM of the empty set is null (m(0) = 0). This rule verifies fhe normality con- 
dilion and works under a closed world where Q. is exhausfive. 


In order fo solve fhe problem highlighfed by Zadeh’s counler example lll6l] 
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where Dempster’s rule of combination produced unsatisfactory results, many com¬ 


bination rules appeared. Smets il4|] proposed an open world where a positive mass 


can be allocated to the empty set. Hence the conjunctive rule of combination for 
two mass functions m\ and m 2 is defined as follows: 


m\(Q)2{A) = {mx@m2){A)= ^ mi(B)xm2(C) 

Bnc=A 


( 8 ) 


Even if Smets jlTII interpreted the BBM, ^102(0), as an amount of conflict be¬ 


tween evidences that induced m\ and m 2 ; that amount is not really a conflict be¬ 
cause it includes a certain degree of auto-conflict due to the non-idempotence of 


the conjunctive combination jlSll . 


The conjunctive rule is used only when both sources are reliable. Smets 11411 


proposed also to use a disjunctive combination when an unknown source is unre¬ 
liable. The disjunctive rule of combination is defined for two BBAs m\ and m 2 as 
follows: 

mi(y) 2 (A) = (mi(y)m 2 )(A) = ^ m\{B)xm 2 {C) (9) 

j-, BUC=A 

Yager in interpreted m(0) as an amount of ignorance; consequently it is 
allocated to Q.. Yager’s rule of combination is also defined to combine two mass 
functions m\ and m 2 as follows: 


my(Y) = mi(n) 2 (Y) VX C D, X / 0 

mr(D) =mi(n)2(^^) + '«i(n)2(0) 
my {lb) = 0 


( 10 ) 


Dubois and Prade’s solution OD was to affect the mass resulting from the combina¬ 
tion of conflicting focal elements to the union of these subsets: 


mDp{B) = mi(^2{I^) + ^ mi(X)m 2 (A) VA C D, A 7 ^ 0 

Anx=0, AUZ=B (11) 


tnDp{lb) = 0 


6 












Conjunctive, disjunctive and Dempster’s rules are associative and commutative, 
but Yager and Dubois and Prade’s rules are not associative, even if they are com¬ 
mutative. Unfortunately, all combination rules described above are not idempotent 
because m@m ^ m and m@m ^ m. 


Mean combination rule detailed in 


tnMean, of two mass functions mi and 


m 2 is the average of these ones. Therefore, for each focal element A of M mass 
functions, the combined one is defined as follows: 


M 


mMean ^ ^ "h (A) 


( 12 ) 


!=1 


Besides idempotence, this combination rule verifies normalify condifion (m(0) = 0) 
if combined mass functions are normalized (V/ G M, m, (0) = 0). We note also fhaf 
fhis combination rule is commutative but not associative. 

All combination rules described above work under a strong assumption of cog¬ 
nitive independence since they are used to combine mass functions induced by 
two distinct sources. This strong assumption is always assumed but never verified. 


H, 


Denoeux IIIOI] . proposed a family of conjuncfive and disjuncfive rules based on fri- 
angular norms and conorms. Cautious and bold rules are members of fhaf family 
and combine mass functions for which independence assumption is nol verified. 
Cautious combination of fwo mass functions m\ and m 2 issued from probably de- 
pendenf sources is defined as follows: 




(13) 


Where A"'! and A'^^D) simple support functions focused on A wifh weighfs 


wi and W 2 issued from fhe canonical decomposifion lll5h of mi and m 2 respectively, 


nofe also fhaf A is a min operator of simple supporf functions weighfs. The bold 
and caufious combinafion rules are commufafive, associafive and idempofenf. 

To summarize, the choice of the combination rule is based on the dependence 
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of sources. Combination rules like H, y, la LZL [Sfl combine mass functions which 
sources are independent, whereas cautious, bold and mean rules are the most fitted 
to combine mass functions issued from dependent sources. 

In this paper, we propose a method to quantify sources’ degrees of indepen¬ 
dence that may be used in a new mixed combination rule. In fact, we propose a 
statistical approach to learn sources’ degrees of independence from all provided 
evidential information. Indeed, two sets of evidential information assessed by two 
different sources are classified info fwo sefs of clusfers. Clusfers of bofh sources 
are mafched and fhe independence of each couple of mafched clusfers is quanti¬ 
fied in order fo esfimafe sources’ degrees of independence. Therefore, a clusfering 
fechnique is used fo gafher similar objecfs info fhe same clusfer in order fo sfudy 
fhe source’s overall behavior. Before infroducing our learning mefhod, we defail in 
fhe nexf section fhe evidential clusfering algorifhm fhaf will be used in fhe learning 
of sources’ degrees of independence. 


3. Evidential clustering 


In this paper, we propose a new clustering technique to classify objects; their 
attributes values are evidential and classes are unknown. Proposed clustering al¬ 
gorithm uses a distance on belief functions given by Jousselme et al. H such as 
proposed by Ben Hariz et al. H. 

Ben Hariz et al. B detailed a belief .K'-modes classifier in which Jousselme 
distance Q] is adapted to quantify distances between objects and clusters modes. 
These are sets of mass functions; each one is the combination of an attribute’s val¬ 
ues of all objects classified into that cluster. An object is attributed to the cluster 
having the minimum distance to its mode. 


Temporal complexity of clustering algorithm proposed by Ben Hariz et al. ll20ll 













is quite high as clusters modes and distances are computed in each iteration. The 
combination by the mean rule to compute modes values leads to mass functions 
with a high number of focal elements. Hence, the bigger the cluster is, the least 
significant is the distance. 

We propose a clustering technique to classify objects that attributes values are 
uncertain. However uncertainty is modeled with the theory of belief functions de¬ 
tailed in Section |2] In the proposed algorithm, we do not use any cluster mode to 
avoid the growth of focal elements number in clusters modes. Temporal complex¬ 
ity is also significantly reduced because all distances are computed only once. 

In this section, K is the number of clusters Clk (1 < ^ < K)-, n is the number 
of objects to be classified; is fhe number of objecfs classified info clusfer C4; 
Oi are objecfs fo classify o, : 1 < / < n; c is fhe number of evidenfial affribufes 
: 1 < 7 < c which domains are Q.a- and finally niij is a mass function value 
of affribufe for objecf Mass funcfions can be cerfain, probabilistic, 

possibilisfic, evidenfial and even missing. 

To classify objecfs o,- info K clusfers, we use a clusfering algorifhm wifh a dis- 
fance on belief funcfions given by . The number of clusfers K is assumed fo be 
known. Proposed clusfering fechnique is based on a disfance which quanfifies how 
much is far an objecf o, from a clusfer C4- This disfance is fhe mean of disfances 
befween o,-, and all objecfs Oq fhaf are classified info clusfer C4 as follows: 

I nk 

D{oi,Clk) = — y dist{oi,Oq) (14) 


and 


wifh : 


dist{oi,Og) 


-Y,d{mij,mqj) 


d{mij,mqj) 



-niqjyainiij 


mqj) 


(15) 


(16) 
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such that: 


D(A,fi) = 


1 ifA = B = 0 

|AnB| wx I, _ 

IauM 


(17) 


Each object is affected to the most similar cluster in an iterative way till reach¬ 
ing an unchanged cluster partition. It is obvious that clusters number K must be 
known. Temporal complexity of the proposed algorithm is significantly optimized 
as pairwise distances are computed once a time from the beginning. We do not use 
any cluster mode. Consequently, there will be no problem of increasing number of 
focal elements because attributes values are not combined. Indeed, the evidential 
clustering algorithm provides a cluster partition that minimizes distances between 
objects into the same cluster and maximizes the distance between objects classified 
info differenf clusfers. The main assef of fhe evidential clustering algorifhm accord¬ 


ing fo fhe belief A'-modes proposed by Ben Hariz el al. 12011 is fhe opfimizalion of 


fhe lemporal complexify. In facf, run-lime of fhe evidenlial clusfering algorifhm 
is improved. The opfimizalion of run-fime depends on fhe size of fhe frame of 
discernmenl fhe number of clusfers K and number of objecls n. For exam¬ 

ple, figure [U shows a big gain in fhe run-lime of evidenlial clustering according 
lo fhe belief A'-modes when fhe number of mass functions varies, n G [10,1000]. 
Temporal complexify of fhe evidential clustering algorifhm is oplimized and lhal 
opfimizalion is especially noficed when fhe number of mass funcfions fo classify is 
high and also when fhe frame of discernmenl conlains many hypolheses. Thanks 
fo fhe improve of fhe lemporal complexify. Ibis clusfering algorifhm is used in fhe 
following sections. 


4. Learning sources independence degree 


In Ihis secfion we extend paper 112 ih for many sources, and propose a com¬ 


bination rule emphasizing sources independence degree. In fhe Iheory of prob- 
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Figure 1: Run-time optimization of the evidential clustering and the belief A'-modes 


H 


according 


to u e [10,1000], I Q.aj 1= 5 and K = 5 


abilities, two hypotheses X and Y are assumed to be statistically independent if 
P{X ny) = P{X) X P{Y) or /’(X|F) = P{X). In the context of the theory of belief 
functions, Shafer defined cognitive and evidential independence. 

Definition 1. “Two frames of discernment may be called cognitively independent 
with respect to the evidence if new evidence that bears on only one of them will 
not change the degree of support for propositions discerned by the other” 3- 

The cognitive independence is a weak independence', two variables are inde¬ 
pendent with respect to a mass function if new evidence that bears on only one 
of the two variables does not change propositions discerned by the other one. For 
two variables X and Y such that Q.x and fly their domains (frames of discernment) 
and Q.X x fly the product space of domains fix and fly. Variables X and Y are 
cognitively independent with respect to if: 


plQ.xxQ.y ^x,y) ^ X pfixxQ.riQ.r 


( 18 ) 


page 149 
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Note that Q.x x Q.y i Q-x is the marginalization of Q.x x Qy in Qx 117112211 . 

Shafer 1^ defined also a strong independence called evidential independence as 


follows: 


Definition 2. “Two frames of discernment are evidentially independent with re¬ 
spect to a support function if that support function could be obtained by combining 
evidence that bears on only one of them with evidence that bears on only the other”. 


Two variables are evidentially independent if their joint mass function can be 
obtained by combining marginal mass functions that bears on each one of them. 
Variables X and Y are evidentially independent with respect to if: 

( plClxXny ^ X pl^X xClyiay 

^ {x,y) = (x) x (y) 

Independence can also be defined in terms of irrelevance. The knowledge of 
the value of one variable does not change the belief on the other one. In the theory 
of belief functions, irrelevance is based on the conditioning. Variables X and Y 
are irrelevant with respect to m, IRm{X,Y) if the marginal mass function on X is 
obtained by conditioning the joint mass function on values y of T and marginalizing 
this conditioned joint mass function on X: 


^^x^Q.y\.ax oc 


( 20 ) 


Note that proportionality oc is replaced by equality when ^Q.x>^€iyiQ.x 

are normalized. 

Doxastic independence is especially proposed in the theory of belief functions 
by 1 IIIL I 12 I 1 and it is defined as follows: 

Definition 3. “Two variables are considered as doxastically independent only when 
they are irrelevant and this irrelevance is preserved under Dempster’s rules of com¬ 
bination”. 
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In other words, two variables X and Y are doxastically independent if they are 
irrelevant with respect to m © mo when they are irrelevant with respect to m and tuq. 
Indeed, if X and Y are irrelevant according to any mass function m and if they are 
also irrelevant with respect to another mass function mo; they are assumed to be 
doxastically independent if they are irrelevant with respect to the orthogonal sum 
of m and mo . Thus, if IRm{X,Y), IRmo{X,Y) and IRm^mciXiY) is verified then X 
and Y are doxastically independent. 

This paper is not focused on variables independence jllLll2L jj] but on sources 
independence. Sources independence is computed according to a set of different 
belief functions provided by each source separately. Sources are dependent when 
all their beliefs are correlated, there is a link between all mass functions they pro¬ 
vide. This problem is not tackled till now, we noticed a lack of references treating 
this problem. To study sources independence, a great number of mass functions 
provided by both sources is needed. This set of mass functions must be defined on 
the same frame of discernment according to the same problems. For example, two 
distinct doctors provide n diagnoses in the examination of the same n patients. In 
that case, the frame of discernment contains all diseases and is already the same 
for both doctors. We define sources independence as follows: 


Definition 4. Two sources are cognitively independent if they do not communicate 
and if their evidential corpora are different. 

Definition 5. Evidential corpus is the set of all pieces of evidence held by a source. 


Not only communicating sources are considered dependent but also sources 
having the same background of knowledge since their beliefs are correlated. The 
aim of estimating sources independence is either to guide the choice of combina¬ 
tion rules when aggregating their beliefs, or to integrate this degree of indepen¬ 
dence in a new combination rule. 
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In this paper, mass functions provided by two sources are studied in order to 
reveal any dependence between them. In the following, we define an independence 
measure Id, (Id(‘5l,52))^ as the independence of on S 2 verifying the following 
axioms: 

1. Non-negativity: The independence of a source on another source ^ 2 , Id(‘S'i, i' 2 ) 
cannot be negative, it is either positive or null. 

2. Normalization: The degree of independence Id is a degree over [0,1], it is 
null when the first source is dependent on the second one, equal to 1 when it 
is completely independent and a degree from [0,1] otherwise. 

3. Non-symmetry: In the case where is independent on S 2 , S 2 is not necessar¬ 
ily independent on ^i. Even if and $2 are mutually independent, degrees 
of independence are not necessarily equal. 

4. Identity: Any source is completely dependent on itself and Id(5i,5i) = 0. 

If and ^2 are independent, there will be no correlation between their mass func¬ 
tions. The main idea of this paper is: First, classify mass functions provided by 
each source separately. Then, study similarities between cluster partitions to reveal 
any dependence between sources. By using clustering algorithm, sources overall 
behavior is studied. The proposed method is in three steps: First, mass functions 
of each source are classified. Then, similar clusters are matched. Finally, weights 
of linked clusters and sources independence are quantified. 

4.1. Clustering 

Clustering algorithm detailed in Section [3] is used to classify two sets of n 
mass functions respectively provided by sources and S 2 - Clustering algorithm is 
performed on all mass functions of independently of the clustering performed 
on those of S 2 . We remind that all mass functions of both sources are defined on the 
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same frame of discernment and so considered as values of only one attribute when 
classifying their corresponding objects. For the same example of doctors, patients 
are objects to classify according to an attribute disease. Values of this attribute 
are mass functions defined on the frame of discernment enumerating all possible 
diseases. Distance ([14]) can be simplified as follows because we have only one 
affribufe: 

I % 

D{oi,Clk) = — 52 d{mi,mq) ( 21 ) 

q=\ 

In Ibis paper, we fix fhe number of clusfers fo fhe number of hypofheses in fhe 
frame of discernmenf. In a classification poinf of view, number of hypofheses is 
fhe number of possible classes. For example, fhe frame of discernmenf of fhe 
affribufe disease enumerafes all possible diseases. Hence, when a doctor examines 
a pafienf, he gives a mass funclion as a classification of fhe pafienf in some possible 
diseases. 

4.2. Cluster matching 

After clustering fechnique, bofh mass funclions provided by and S 2 are dis- 
fribufed separafely on K clusfers. In fhis secfion, we fry fo find a mapping befween 
clusfers in order fo link fhose confaining fhe same objecfs. If clusfers are perfecfly 
linked, meaning all objecfs are classified similarly for bofh sources, we can con¬ 
clude fhaf sources are dependenf as fhey are choosing similar focal elemenfs (nol 
confradicfory af leasf) when providing mass funclions for same objecfs. If clusters 
are weakly linked, sources choose similar focal elemenfs for differenl objecfs and 
so fhey are independenf. Clusters independence degree is proportional to fhe num¬ 
ber of objecfs similarly classified. More clusfers conlain fhe same objecfs, more 
fhey are dependenf as fhey are correlated. 

We note Cll^ where 1 < < .S' for clusters of and where I <k 2 < K 

for fhose of S 2 . The similarify befween Iwo clusfers Cll^ and is fhe proporlion 
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of objects simultaneously classified into Cll^ and 


. . ■ ic/incz/. 

PI =l5‘{cil,cii)= 


cii 


( 22 ) 


with i^j G {1,2)^ and i ^ j. Pkik2 quantifies a proportion of objecfs classified 


simulfaneously in clusfers and CZ|^ wifh regard fo objecfs in CZ^^, analogically 


ki 


Pk2ki is a proportion of objecfs simulfaneously in CZ^^ and CZ^^ wifh regard fo fhose 
in CZ^^. Note fhaf Pkik2 ^ Pk 2 ki since fhe number of objecfs classified info CZ^^ and 
CZf^ are differenf (| CZ/^ |/| CZf^ |). 

We remind fhaf are similarities fowards and p^ are fhose fowards ^ 2 . It 
is obvious that j3'(CZj,_,CZ^ ) = 0 when CZj,. and Cll. do not contain any common 
object; however they are completely different. j8'(CZ^ ,CZ^.) = 1 when these clus¬ 
ters are strongly similar so they contain the same objects. A similarity matrix M\ 
containing similarities of clusters of according to those of S 2 iP^), and M 2 the 
similarity matrix between clusters of S 2 and those of (p^) are defined as follows: 




P12 ■ 

• PIk 


^PL 

PI2 ••• 

PL^ 

Ml = 

PL 

PL • 

•• pIk 

and M 2 = 

PL 

PL ••• 

PIk 




fi.L. , 



R2 

ft 2 _, , 


(23) 


We note fhaf Mi and M 2 are differenf since Pkik2 ^ ^hki- Clusters of are 
mafched fo fhose of S 2 according fo maximum of p^ such fhaf each cluster CZ^^ is 
linked fo only one clusfer CZ|^ and each clusfer CZ^^ has only one cluster CZ^^ linked 
fo if. The idea is fo link iferafively clusfers having fhe maximal jS Mn Mi fhen elim- 
inafe fhese clusfers and fhe corresponding line and column from fhe mafrix unfil 
having a bijecfive clusfer mafching. Algorifhm[T] defails cluster mafching process. 
We note fhaf differenf mafchings are obfained for ^i and S 2 because Mi and M 2 
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are different. This algorithm is iterative and the number of iteration is equal to the 


Algorithm 1 Cluster matching 
Require: Similarity matrix M . 

1: while M is not empty do 

2: Find max{M) and indexes c and I of clusters having this maximal similarity. 

3: Map clusters I and c. 

4: Delete line I and column c from M. 

5: end while 

6: return Cluster matching. 


number of clusters K. Even if this algorithm is quite simple, it provides a matching 
of clusters in order to compare evidential information provided by both sources. 


The assignment algorithm proposed in 1231] for square matrices and that for rectan¬ 
gular matrices Q can also be used to minimize the dissimilarity between matched 
clusters. Other methods for cluster matching ll25ll and 12611 can also be used. 


4.3. Cluster independence 

Once cluster matching is obtained, a degree of independence/dependence of 
matched clusters is quantified in this step. A set of matched clusters is obtained for 
both sources and a mass function can be used to quantify each couple of clusters 
independence. Assume that cluster is matched to CZ|^, a mass function 
defined on fhe frame of discernment Q.j = {Dependent Dep,Independent Ind} 


^We note the frame of discernment in the mass functions to avoid confusion. 


17 









describes how much this couple of clusters is independent or dependent as follows: 

^kkS^ep) = 

-Kk,) <^ 24 ) 

mf'f^\Dep(Jlnd) = 1 — a^. 

A mass function quantifies the degree of independence of each couple of clusters 
according to each source; ' is a mass function for the independence of each 
linked clusters Cl[. and CZ^ according to Si with i,j G {1,2} and i ^ j. Coefficient 
is used to take into account of number of mass functions in each cluster CZj,. of 
the source i. Reliability factor is not the reliability of any source but it can be 
seen as the reliability of the clusters independence estimation. Consequently, inde¬ 
pendence estimation is more reliable when clusters contain enough mass functions. 
For example, assume two clusters; one containing only one mass function and the 
second one containing 100 mass functions. It is obvious that the independence esti¬ 
mation of the second cluster is more precise and significant than the independence 
estimation of the first one. 

Reliability factors are proportional to the number of hypotheses in the frame of 
discernment | |, and the number of objects classified in CZ{. as follows: 

ai=fi\Q\,\Cll\) (25) 

The bigger | | is, fhe more mass functions are needed fo have a reliable cluster 

independence esfimafion. For example, if | | = 5 fhen fhere are 2^ possible focal 

elemenfs, also independence esfimafion of a cluster confaining 20 objecfs cannof 
be precise. No exisfing mefhod fo define such function /. Hence, we use simple 
heuristics as follows: 

4 . = 1 -^ ( 26 ) 

I I 
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Reliability factor 


As shown in figure |2j if | H | and number of mass functions in a cluster are big 
enough, cluster independence mass function is almost not discounted. Reliability 
factor is an increasing function of | | and | CZ[. | which favors big clusterj^ 



Figure 2: Reliability factors 


4.4. Sources independence 

Obtained mass functions quantify each matched clusters independence accord¬ 
ing to each source. Therefore, K mass functions are obtained for each source such 
that each mass function quantifies the independence of each couple of matched 
clusters. The combination of K mass functions for each source using the mean, 
defined by equation (fT^ . is a mass function m^‘ defining the whole independence 
of one source on another one: 

(A) = 1 £ mf'/AA) VA C 2^ (27) 

k,=i 

With kj is the cluster matched to kt according to Sj. Two different mass func¬ 
tions and are obtained for and S 2 respectively. We note that 

^Big clusters are those containing enough mass functions according to | |. 
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is the combination of K mass functions representing the independence of matched 
clusters according to defined using equation (l24b . Mass functions and 

ffP-hS 2 are different since cluster matchings are different which verifies the axiom of 
non-symmetry. Pkik 2 '’Pk 2 ki ^ I®’ verify the non-negativity and the normalization 
axioms. Finally, pignistic probabilities are computed from these mass functions in 
order to decide about sources independence Id such that: 


Id(‘5'i,‘S'2) = BetP(/nd) 
=BetP(Dep) 


(28) 


If Id(‘S'i,i' 2 ) > Id(‘5'i,‘S'2) we claim that sources and S 2 are independent otherwise 
they are dependent. 


45. General case 

The method detailed above estimates the independence of one source on an¬ 
other one. Independence measure is non-symmetric because if a source is inde¬ 
pendent on a source S 2 then S 2 is not necessarily independent on and even if it is 
the case, degrees of independence are not necessarily the same. 

It is wise to choose the minimum independence from Id(‘yi ,■^ 2 ) and Id(52,‘^i) as 
the overall independence. Consequently, if at least one of two sources is dependent 
on the other, then sources are considered dependent. In other words, two sources 
are independent only if they are mutually independent. Hence, overall indepen¬ 
dence that is denoted I{si,S 2 ) is given by: 

7 ( 51 , 52 ) =mm(Id(5i,52),Id(52,5i)) (29) 

We note that 7 ( 51 , 52 ) is non-negative, normalized, symmetric and identical. 

We define an independence measure, noted 7, generalizing the independence for 
more than two sources verifying the following axioms: 
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1. Non-negativity: Many sources independence { 51 , 52 , 53 ,... noted 

Id(‘5'i,52,..., Sns) cannot be negative, it is either positive or null. 

2. Normalization: Sources independence 7 is a degree in [0,1]. The minimum 
0 is reached when sources are completely dependent and the maximum 1 is 
reached when they are completely independent. 

3. Symmetry: /(51,52,53,... , 5 „,s) is the sources’ overall independence and 

7(51, 52 , 53 ,..., 5„,) = 7 ( 52 , 51 , 53 ,...,5„,) = 7 ( 53 , 51 , 52 ,...,5„,). 

4. Identity: 7(5i,5i,5i) = 0. It is obvious that any source is completely depen¬ 
dent on itself. 

5. Increasing with inclusion: 7 ( 51 , 52 ) < 7 ( 51 , 52 , 53 ), more there are sources, 
more they are likely to be independent. 


To compute the overall independence of ns sources { 51 ^ 2 , ■. ■ indepen¬ 
dencies of pairs of sources are computed and the maximurrG independence is the 
sources overall independence: 


I{si,S 2 ,...,Sns) =max{I{si,Sj)), V/G[l,n5] J €]i,ns] (30) 


or equivalently: 


7(51,52 ^ns ) =max{min{Id{si,Sj),Id{sj,Si))), \/i,je[l,ns]iy^j (31) 


Independence degree of sources is then integrated in the combination step using 
the following mixed combination rule. 


5. Combination rule 


Combination rules using conjunctive and/or disjunctive rules such as y, 


6 , 


71,1^ are used when sources are completely independent but cautious and bold 


■^The maximum is used to insure the property of increasing with inclusion. 
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rules lid] tolerate redundant information and consequently can be used to combine 
mass functions which sources are dependent. In the combination step, sources de¬ 
pendence or independence hypothesis is intuitively made without any possibility 
of check. Sources independence degree is neither 0 nor 1 but a level over [0,1]. 
The main question is “which combination rule to use when combining partially 
independent\dependent mass functions?” 

In this paper, we propose a new mixed combination rule using conjunctive and 
cautious rules detailed in equations ([8]) and ([T^ . In the case of totally dependent 
sources (where independence is 0), the cautious and proposed mixed combination 
rules are similar; whereas in the case of totally independent sources (independence 
is 1), the conjunctive and proposed combination rules are similar. In the case of an 
independence degree in ]0,1[, combined mass function is the average of conjunc¬ 
tive and cautious combinations weighted by sources’ independence degree. 

Assume that two sources and $2 are independent with a degree y such that 
7 = I{s\,S 2 )', nil and m 2 are mass functions provided by and S 2 - The proposed 
mixed combination rule is defined as follows: 

niMixed{A) = y*m(^{A) -f (1 - 7 ) * m@{A), VA C (32) 

The degree of independence of a set of sources is given by equation (l30l) . and 
the mixed combination of a set of mass functions {mi,m 2 ,... ,m„i} provided by 
sources {^i ,^ 2 ,... is also a weighted average such that: 

y = I{si,S 2 ,...,Sns) (33) 

Properties of the proposed mixed combination rule: 

• Commutativity: Conjunctive and cautious rules are commutative. Indepen¬ 
dence measure is symmetric because sources’ degree of independence is the 
same for a set of sources. Then the proposed rule is commutative. 
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• Associativity: Conjunctive and cautious rule are associative but the proposed 
rule is not because independence degree of n sources and n + \ ones is not 
necessarily the same. 

• Idempotent: Degree of independence of one source to itself is 0, in that case 
the proposed rule is equivalent to the cautious rule. As the cautious rule is 
idempotent, it is the case of the proposed mixed rule. 

• Neutral element: Mixed combination rule does not have any neutral element. 

• Absorbing element: No absorbing element also. 

Example. Assume a frame of discernment Q. = {a,b,c} and two sources and S 2 
providing two mass functions m\ and m 2 . Tabled] illustrates conjunctive and cau¬ 
tious combinations as well as mixed combination in the cases where 7 = 0, 7 = 0.3, 
7 = 0.6 and 7 = 1 . When 7 = 0, mixed and cautious combinations are equivalent; 
when 7 = 1 , mixed and conjunctive combinations are equivalent, otherwise it is a 
weighted average by 7 €] 0 , 1 [. 

Finally, to illustrate the proposed mixed combination rule and compare it to 
other combination rules, three mass functions are generated randomly using algo¬ 
rithm m These mass functions are combined with conjunctive, Dempster, Yager, 
disjunctive, cautious and mean combination rules. They are also combined with 
the mixed combination rule with different independence levels. 

Figure [3] illustrates distancebetween the mixed combination with several de¬ 
grees of independence and combined mass functions using conjunctive, Dempster, 
Yager, disjunctive, cautious and mean combination rules. Distances between mixed 
combination with several independence degrees; and Yager, disjunctive, mean and 


^Jousselme distance detailed in equation l ll6t . 
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Dempster’s rules are linear and deereasing proportionally to 7 . 


Table 1: Combination of two mass functions 


2^ 

mi 

m 2 



tflMixed 

7 = 0 

mMixed 

7 = 0.3 

tflMixed 

7 = 0.6 

tflMixed 

7= 1 

0 

0 

0 

0.1071 

0.06 

0.1071 

0.093 

0.0789 

0.06 

a 

0.3 

0.3 

0.2679 

0.45 

0.2679 

0.3225 

0.3771 

0.45 

b 

0 

0 

0 

0 

0 

0 

0 

0 

aUb 

0 

0 

0 

0 

0 

0 

0 

0 

c 

0.2 

0 

0.1786 

0.14 

0.1786 

0.167 

0.1554 

0.14 

aUc 

0.2 

0.4 

0.2551 

0.26 

0.2551 

0.2566 

0.2580 

0.26 

bUc 

0 

0 

0 

0 

0 

0 

0 

0 

aUbUc 

0.3 

0.3 

0.1913 

0.09 

0.1913 

0.1609 

0.1305 

0.09 


6. Experiments 

Beeause of the laek of real evidential data, we use generated mass funetions to 
test the method detailed above. Moreover, it is diffieult to simulate all situations 
with all possible eombinations of foeal elements for several degrees of indepen- 
denee between sourees. First, we generate two sets of mass funetions for two 
sourees and S 2 ', then we illustrate for three sourees. 

6.1. Generated data depiction 

Generating sets of n mass funetions for several sourees depends on sourees 
independenee. We diseern eases of independent and dependent sourees. 
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Distances 


- Distances to the Conjunctive combination 



Figure 3: Distances between combined mass functions 


6.1.1. Independent sources 

In general, to generate mass functions some information are needed: the num¬ 
ber of hypotheses in the frame of discernment, | Q. \ and the number of mass func¬ 
tions. We note that number of focal elements, and masses are chosen randomly. 

In the case of independent sources, masses can be anywhere and focal elements 
of both sources are chosen independently. Mass functions of and S 2 are generated 
following algorithm m We note that focal elements, their number and BBMs are 
chosen randomly according to the universal low. 

Algorithm 2 Independent mass functions generating 
Require: |fl|, n : number of mass functions 

1: for / = 1 to n do 

2: Choose randomly | F |, the number of focal elements on [1, |2^|]. 

3: Choose randomly | F \ focal elements noted F. 

4: Divvy the interval [0,1] into |F| continuous sub-intervals. 

5: Focal elements BBMs are intervals sizes. 

6 : end for 

7: return n mass functions 
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6.1.2. Dependent sources 

The case of dependent sources is a bit difficult to simulate as several scenarios 
can occur. In this section, we will try to illustrate the most common situations. 
Generated mass functions for dependent sources are supposed to be consistent and 
do not enclose any internal conflict 1^ . Consistent mass functions contain at least 
one focal element common to all focal sets. Figure |4] illustrates a consistent mass 
function where all focal elements {A, B, C, D} intersect. 



Algorithm |3] generates a set of n consistent mass function^ defined on a frame 
of discernmenf of size | |. In fhe case of dependenf sources, fhey are almost 

consistent and at least one of them is dependent on the other. To simulate the case 
where one source is dependent on another one, consistent mass functions of the 
first one are generated following algorithm |3l then those of the second source are 
generated knowing decisions of the first one. Algorithm |4] generates a set of mass 
functions that are dependent on another set of mass functions. Dependence is due 
to the knowledge of other source’s decisions. 


^Conflict within such mass functions is null. 
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Algorithm 3 Consistent mass functions generating 
Require: |fl|, n : number of mass functions 

1 : for / = 1 to n do 

2 : Choose randomly a focal set ft), (it can be a single point) from fl. 

3: Find the set S of all focal sets including ft),. 

4: Choose randomly | F \, the number of focal elements on [1, |S|]. 

5: Choose randomly | F \ focal elements from S noted F. 

6 : Divvy the interval [0,1] into |F| continuous sub-intervals. 

7: BBMs of focal elements are intervals sizes. 

8 : end for 

9: return n consistent mass functions 
6.2. Results of tests 

Algorithms detailed in the previous section are used to test some cases of 
sources’ dependence and independence. We note that in extreme cases where mass 
functions are certain or even when focal elements do not intersect; maximal values 
of independence are obtained. In the case of perfect dependence; mass functions 
have the same focal elements; however, clusters contain mass functions with con¬ 
sistent focal elements. Clustering is performed according to focal elements and 
clusters are perfectly linked. 

6.2.1. Independent sources 

In this paragraph, mass functions are independent. Focal elements and BBMs 
are randomly chosen ensuing algorithm |2l For tests, we choose | | = 5 which is 

considered as medium-sized frame of discernment and n = 100. Table [2] illustrates 
the mean of 100 tests in the case of independent sources. The mean of 100 tests for 
two dependent sources yields to a degree of independence 7 = 0 . 68 , thus sources 
are independent. Assume that ni\ and m 2 , given in tabled are provided by two 
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Algorithm 4 Dependent mass functions generating 

Require: |D|, n : number of mass functions, d decision of another source 

1: for / = 1 to n do 

2: Find the set S of all focal sets including d. 

3: Choose randomly | F |, the number of focal elements on [1, iS'j]. 

4: Choose randomly | F \ focal elements from S noted F. 

5: Divvy the interval [0,1] into |F| continuous sub-intervals. 

6: Focal elements BBMs are intervals sizes. 

7: end for 

8: return n consistent mass functions 


sources and S 2 which independence degree is given in table |2] Combination of 
m\ and m 2 is given in table [3] 

To illustrate the case of three independent sources, three sets of 100 indepen¬ 
dent mass functions are generated following algorithmic] with | D | = 5. The mean 
of 100 tests are illustrated in table ID 


Table 2: Mean of 100 tests on 100 generated mass functions for two sources 


Dependence type 

Degree of independence 

Overall independence 

Independence 

Id{s\,S2) =0.68, rd{si,S2) =0.32 

7 = 0.68 

Id{s2,s\) =0.68, rd{s2,s\) =0.32 

Dependence 

Id{suS2) =0.34, rd{si,S2) =0.66 

7 = 0.34 

Id{s2,s\) =0.35, rd{s2,si) =0.65 


6.2.2. Dependent sources 

In the case of dependent sources, mass functions are generated ensuing algo¬ 
rithms [3] and |4| For tests, we choose | D | = 5 and n = 100. We generate 100 mass 
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Table 3: Mixed combination of m\ and m 2 


20 . 

m\ 

m 2 

f^Mixed 

7 = 0.68 

mMixed 

7 = 0.34 

0 

0 

0 

0.092 

0.076 

a 

0.3 

0.3 

0.3262 

0.3881 

b 

0 

0 

0 

0 

aLib 

0 

0 

0 

0 

c 

0.2 

0 

0.1662 

0.1531 

aUc 

0.2 

0.4 

0.2567 

0.2583 

bVJc 

0 

0 

0 

0 

aUbVJc 

0.3 

0.3 

0.1589 

0.1244 


functions of both and $2 for 100 times and then compute the average of Id (^i, ^ 2 ), 
Id(‘^ 2 )‘^i) and I{s 1 , 82 ). Table H] illustrates the mean of 100 independence degrees 
of two dependent sources providing each one 100 randomly generated mass func¬ 
tions. These sources are dependent with a degree 1 — 7 = 0.66. In table |3l mi and 
m 2 are combined using the mixed rule when 7 = 0.34. 

To illustrate the case of three dependent sources, three sets of 100 dependent mass 
functions are generated following algorithms |3] and |4] when | |= 5. The mean of 

100 degrees of independence are illustrated in table |5] 

Finally, assume that mi, m 2 and m 3 of tableare three mass functions defined 
on a frame of discernment Q. = {a,b,c} and provided by three dependent sources. 
The mixed combined mass function when their degree of independence is 7 = 0.35 
is also given in tabled 


29 








Table 4: Mean of 100 tests on 100 generated mass functions for three independent sources 


Sources 

Degree of independence 

Pairwise 

independence 

Overall 

independence 

51-52 

ld{si,S2) =0.67,4/(51,52) =0.33 

7 ( 51 , 52 ) = 0.67 

7 = 0.68 

ld{s 2 ,si) = 0.67, Td{s 2 ,si) = 0.33 

‘5'1-53 

h{si,S2) =0.68, 4t(‘Si,53) = 0.32 

7 ( 51 , 53 ) =0.68 

ld{s2,si) =0.68, 4t(‘5'3Ai) = 0.32 

52-53 

ld{s2,S2) = 0.68, Td{s2-,S2) = 0.32 

I {^ 2 ,S 3 ) =0.68 

Id{s2,S2) = 0.68, Td{sj,,S2) = 0.32 


7, Conclusion 

In this paper, we proposed a method to learn sourees eognitive independenee in 
order to use the appropriate eombination rule either when sourees are eognitively 
dependent or independent. Sourees are eognitively independent if they are differ¬ 
ent; not eommunieating and they have distinet evidential eorpora. The proposed 
statistieal approaeh is based on a elustering algorithm applied to mass funetions 
provided by several sourees. A pair of sourees independenee is dedueed from 
weights of linked elusters after a matehing of their elusters. Independenee degree 
of sourees ean either guide the ehoiee of the eombination rule if it is either 1 or 0; 
when it is a degree over ]0,1 [, we propose a new eombination rule that weights the 
eonjunetive and eautious eombinations with sourees’ independenee degree. 
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